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ABSTRACT 


The rate of convergence of the expected value of 
quantile estimates using a stochastic approximation with 
the maximum transformation is evaluated. The analysis is 
performed using linear regression techniques on computer 
Eemulatıonsresults for quantile estimates of the unit 
Senecae distribution. Included is a discussion on the 
use of the jackknife technique to reduce the bias of the 
stochastic approximation quantile estimates. Simulation 
results for the 2-fold jackknife for the m? term are 
ataca he main conclusion of the analysis is that 
the lowest order term in the expression for the expected 
panca emenescimate as a function of sample size decreases 


E25 
as m : 
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IBI ERODUCTTON 


The problem of estimating quantiles may be found in 
numerous statistical applications. (The a quantile S, 


ux Ne distribution of the random variable S, F(s), is 
-1 


defined as Sa = F MB n 1, and is assumed to be 
aque) he increased use of computer system simulations 
pusulesssomesmpreaet)ical examples. In certain cases the 


quantile estimates from the generated data may be the direct 
Mac urces uscdito evaluate the system. In other applications 
one might find it practical to characterize the distribu- 
tion function of variables at different points in a system 
Using several statistics such as quantiles. The practica- 
A sta ete rmexample may be dictated by the size 
of the simulation and the number of replications performed. 
In any EU E ee eaei icient quantile estimates 
ri ts, 

Two possible solutions to the problem of quantile 
estimation are he order statistic estimate (S, (m)) and 
the stochastic approximation estimate (S, (m)) of Robbins- 
Monro (1951) where m is the sample size. These methods 
each have their benefits and limitations which warrant 
comparison. In this regard, the increased use of computer 
simulations dictates that the comparative efficiency of 
these methods not be based solely on statistical efficiency 
but computational efficiency as well. The term computational 


efficiency refers to the time and cost of programming, 





debugging, computing, and data storage requirements in the 
application of a given estimation scheme. 

Cosmos E Robbins (1973) used these considera- 
Rom mn a Comparative analysis of the order statistic 
estimate and the stochastic approximation estimate with 
the maximum (minimum) transformation in estimating extreme 
quantiles (e.g., quantiles corresponding to probability 
levels a = 0.999). The maximum transformation is a device 
which they utilized to alleviate the problem of the appar- 
ently very slow convergence of the stochastic approximation 
when applied to extreme quantiles., They also indicated the 
possible existence of similar problems in highly skewed 
eks Er butions even for moderate quantiles (e.g., oy = SO 

The considerations for selecting the maximum transfor- 
mation above other possible techniques are discussed by 
Goodman, Lewis, and Robbins (1973). The benefits are that 
it does not require special information about the random 
variable from which the sample is taken to estimate the 
quantile. This kind of information would be required in 
deriving bounds to E antie estimate to alleviate the 
problem of very slow convergence. The maximum transforma- 
penis also computationally efficient in that it requires a 
e ncimory cell and a limited number of binary comparisons. 
These requirements are much simpler and less time consuming 
than the scheme proposed by Kesten (1958) to alleviate 
Mie problem of long runs, Finally, if the maximum 


transformation to the median (a' = 0.5) is used, 





Cochran and Davis (1965) have shown (using simulations) 
that the stochastic approximation works well for this value 
of a', The maximum transformation is used to transform 
mer quantiles a > 0.5 to a? = 0.5 and the minimum trans- 
formation used where a < 0.5. 

Ihe conclusions of Goodman, Lewis, and Robbins (1973) 
were that the stochastic approximation with maximum trans- 
formation provides a practical alternative to the problem 
of extreme quantile estimation. It is computationally 
efficient; computing time is linearly dependent on sample 
size (m) and storage requirements are small and fixed. 
statistically, the estimate was relatively free of bias 
although there existed some increase in the variance of 
the estimate. Also included in their report were the 
results of the application of the jackknife for a presumed 
m`? term in the bias to the stochastic approximation 
scheme. Their simulation results indicated that conver- 
gence was slower than m`} Ade y Conjectured the possible 


ace of anm? 


bias term. This implied the possibility 
Of further improving the stochastic approximation with 
maximum transformation using the appropriate jackknifing 


procedure. It was this conjecture that prompted the 


analysis for this paper. 





JOU EROBZEM AND APPROACH 


nr ob Ecmanproacçhed was to evaluate the rate of 
convergence of the stochastic approximation with a maximum 
transformation with the intent to further improve the 
stochastic approximation with the dcin fe method of bias 
Peauetion proposed by Quenouille (1956). A secondary 
problem was to evaluate Turkey's assertion that the sample 
variance of the "pseudo-values" used in forming the jack- 
knifed estimate could be used to estimate the variance of 
the estimate, i.e., Var(S, (m)). 

The approach taken to solve the problem consisted of 
two basic steps. Initially, a preliminary analysis was 
made on the data obtained by Goodman, Lewis, and Robbins 
>) 


rather than om +). The linear least square computer pro- 


(1973) to determine if the bias was of the order O(m ` 


gram of Daniel and Wood (1971) was used to accomplish this 
sS Once the order of the bias term was determined, 
a computer simulation was used to evaluate the effect of 
the jackknife on the rate of convergence and the variance 
of the stochastic approximation estimate with maximum 
transformation and to determine if the sample variance of 
the pseudo-values could be used to estimate the variance 
of the quantile estimate. The distributions and quantiles 
selected for analysis were the unit exponential 0.5, 0.9, 
0.95, 0.975, and 0.999 quantiles and the uniform (0,1) 0.5 


quantile. Initially, the simulation was programmed to 





achieve an m-fold jackknifed estimate but was subsequently 
modified to the 2-fold jackknife for reasons discussed 


Water in the paper. 





T OLINAR REGRESSION 


The preliminary analysis in evaluating the rate of 
convergence of the stochastic approximation with maximum 
transformation was accomplished using a regression analysis 
on the original data obtained by Goodman, Lewis, and 
Robbins (1973). This data consisted of 12 points covering 
reduced sample size values from m' = 8 to 60, where 
m' = m/v and v is a constant computed for the maximum 
transformation, with corresponding values of the mean and 
variance of the stochastic approximation estimates with 
maximum transformation both with and without jackknife. 

The data used for the regression analysis was for the 0.999 
quantile estimates of the unit exponential distribution. 

o RS cmatrons were fitted tO the data to substantiate 
the existence of a term of lower order than mt, In each 
case a weighted linear regression was used to account for 
the varıability of the variance over the range of sample 
values. The weighting factor used was the inverse of the 
sample variance. 

The initial regression was applied based on a logarith- 


mic transformation. The regression equation used was 
In(bias) = In(a) - ln(m') , (3) 
where the values of the bias were computed as 


6 = E(S, (m')) - $, (372) 





and 


6b = Eis, (m')) - S CR) 


Q 


for the data without and with jackknife respectively. 

If the bias is such that b = am f and higher order terms 
sube neglected, then the coefficients obtained from the 
mepression will provide a direct estimate of y. An obvious 
problem exists if the bias contains secondary terms that 

Ee Sa omuticant. Im that case the estimate Y will be incor- 
rect since the effect of secondary terms are not distinguish- 
HcESunder the logarithmic transformation. Consequently, a 
second equation was also applied using multiple independent 
variables to check the significance of any secondary terms. 


The second regression equation used was 
a e A (3.4) 


Ducere v ys considered a known constant to preclude the 

mm quirement for a non-linear regression. The inherent 
peoelem With applying equation 3.4 with a linear regression 
is that an nace or guess oí Y is required. To achieve 
this estimate the results of the initial regression 1n con- 
mem on with the conjecture that y = 0.5 were evaluated to 
derive an estimate for implementing the regression using 


Equation 3.4. 
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IV. JACKKNIFE 


Based on the assumption that the bias of the vuantile 


estimate Ss, (m) is of the form 


E($.) - S, » am Y « bm ^Y « o(m ?), (4.1) 


Eme jackknife technique can be applied to obtain gn estimate 


S. (m) with a bias free of the m ! term: 


Ea) - S, - bm + om) (4.2) 


The initial approach to applying the technique was 
with the m-fold jackknife. Let Š (m) be the estimate based 
on all m sample values and S, (m-1),, IM A e we 
estimate based on m-1 sample values obtained by cmitting 


meth value. Then consider the the pseudo-val' =s Ss m),;, 


i m, computed as 
S (m); = AS, (m) Í B.S (m-1); Bd o E (J 


The jackknifed estimate Sie is then computed as tiis average 


DUNA pseudo values: 


E (SO 


fi 
hms 
ps 


S (m), /m. (4.4) 


EromCquations 4.1, 4.5, and 4.4, 1f A and B are selected 


such that 


Maa a EDO” 


py c oí (4.5) 
mass coe m’ - (m-1)! 
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then the jackknifed estimate will have a bias which is 
Men oY) Or 


Ran com, (4.6) 


£ TU EST 


In the present case where y is assumed to be 0.5 the values 


mand B used in the simulation were 


xi ad GD. > (4.7) 
ym - /(m-T) ym - /(m-1 


The 2-fold jackknife is computed with the modification 
bDactothe sample is partitioned into two disjoint sections 
with $, (0/2); i; eZ) computed from the rth section of half 


Ecesanmple values. Equations 4.5 and 4.4 then reduce to 


CE - A*$ (m) + B-S (m/2), , (4.8) 
S, (m); = AS (m) + B+Š (m/2), , (4.9) 
and Š » T i D (1510) 
a 2 : 


nero is a problem here in that it is possible to segment the 
samme into two disjoint sections in many ways; in what 
follows the sections were the first and second halves of 
pue data. 

Nue ues cof A and B required to achieve the reduction 


in bias then become - 


Y 
A = Ba |. and B = - ————, (4.11) 
eoe D CEN) 
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The resulting jackknifed estimate then is biased as TOMAS 


2 


or specifically 


i 2Y 
La ND a OO (4.12) 


Dp a e D 


Based on the assumption y = 0.5 the values for A and 


B are 2 


E. umm, (4.13) 


(7/2 - 1) (V2 - 1) 


with the result that the jackknifed estimate has the form 


E(S,) MOM EN nen). (4.14) 


Bo 
These calculations indicate that if the jackknife is 
performed for the correct term, the resulting estimate 
mould converge more rapidly but with a negative bias. 
A logical question that arises is what are the conse- 
quences of applying the jackknife for an erroneous term, 
B 


say m. Go ie rinn e rr ei PIO De Of the form in 


equation 4.1 and applying the jackknife with A = 28 (2P - 1) 


and B - EO - 1) the resulting estimate will have the 
form 
z Š _ >Y : 
A + Om 1) NIE 
à zn 


NERE Sult indicates that if the true bias is of the form 


m Y such that Y > ß, where ß is the power of the term being 


13 





machkhnsfed, the resulting estimator should remain biased 
with leading term still decreasing as m ! but with a 


P DUE D ZUM 5rmrplarlys. if 


Seerticrent k = (2 
y and B8 are such that y < B and cy = B where c is some 


positive integer then 
E(S,) = S, + ka/m" * 0(m ^Y) - 0(m N). (4.16) 


Consider the case where y = 0.5 and the jackknife is 
applied for the a term (8 » 1). The resulting estimator 


NE be biased and of the form 


E(S,) = S * .586 am ^? « o(m 1^9). (4.17) 


Thus there is a reduction in the magnitude of the mae 


term in the bias, but the leading term is still of the 
Same Order as before. 

Casual Inspeetion of the simulation results in Goodman, 
Lewis, and Robbins (1975) indicates that the bias is decreasing 
slower than "d If the leading term is in fact mn c THE 
effect of the jackknife Repente das tn Equation 4.17. 
The bias results obtained by Goodman, Lewis, and Robbins 
(1975) are E in Figüūure N 


In the next sections the two types of quantile estima- 


Eron are discussed in more detail. 
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pO 


x 
Exact Baas (Order Statist re) 
x © Estimated Bias (Se) 
@ Estimated Bias(S.) 
O 
Us sp 
O 
x 
O 
Q O 
Q O 
O 
SEO 
x 
© 
0.020 C O 
G x 
e O 
© Q 
» c8 
0.010 
e 
Bee @ bias (fOr fhem der statistic 
EEN Mao Rs SAG FRE Ehe ROR ODO ON an le 
| e Filem ci evpolienpimnmpi s buton, 
0.005 


and estimated bias for estimators Se, 
and $,, using the maximum transforma- 
tion(V=700) and stochastic approxi- 
mation. Jackknife for m^l term. © 


= 


8.3 9.0 9.5 10.0 10.5  In(m) 
CU SE as os) Mais 500 (22. 026)9 ^ 90565 315) m) 


LOG SAMPLE SIZE - In(m) 
(m) 


FIGURE 1 


ES 





OJ A PUP T ESTIMATE ORDER STATISTIC 


The order statistic estimate Sy can be obtained by 
ordering the sample and using S ([am]) as the order 
statistic estimate S pics OS Ae ares tinte cer 
less than or equal to am and d Eo dere dales 

I distribution and density function of the ordered 


sample are: 


m 
(x) CS [1 - Tea (Sa) 


u r3 £z 
pa 


> 


m s ' 
£ (1) (s) = (1) i I o rr I = f(s). RED) 


For the uniform distribution (0,1) the mean and variance 


Mene order statistic then become 


E( SI (5.3) 


S (4)? 


and Var (S - i(m-i«1)/((m*1) (n*2)). (5.4) 


(i)? 


If the sample size m is such that om is equal to an integer, 


then S. = S C Eam]) and equations 5.3 and 5.4 become 
E(S, ) n RR Eo ro Memo (55 
and Var(S,) = om (m=om+1)/{ (m+1)* (m+2)} 5 (579) 


Results using equations 5.5 and 5.6 are tabulated in Table 1l 


Rohe uniform (0,1) 0.5 quantile. 
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Cox and Lewis (1966), using direct methods, arrived 


eme following results for the unit exponential distribu- 


tion 
2 
z = i E a 1 (1-a) -1 -3 
E(S,) * -1n(1 P om sss Po x uU E (S. 
and Var(S_) = Suet HE dco re 0(m^?). (5. 


Ber equations were used to compute the results in 
EUN: 2 through 6 for the unit exponential 0.5, 0.9, 0.95, 


muc and 0.999 quantiles. 
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CIR QUAÂNTILE ESTIMATE: | STOCHASTIC APPROXIMATION 


The maximum transformation was used with the stochastic 
proximation in estimating extreme quantiles to alleviate 
the problem of slow convergence, as previously discussed. 
To implement the maximum transformation consider the sample 


B. 


mam c from the distribution F(s)=prob(S=1) from 

Rich the extreme quantile E aeo OS S to be estimated. 
If the sample is partitioned into subsets of v values each 
(assuming m'=m/v) and the maximum value is taken from each 
s i mi subsets the result is a reduced sample Segre 
Mierdas tribution of the random variable in the reduced 


sample A ls) is then given by 


A ES = F (sS) =a =a! (O 


and consequently S - S 
This result implies that if the transformation to the 
median a' = 0.5 is used, and v is chosen such that 
v = In(a')/In(a), then the stochastic approximation can 
be used to estimate the median quantile S. = DS of the 
reduced sample Sjo tet’ Sm! which is equal to the extreme 
quantile Sy from the original sample. 
The primary distribution used in the simulation was 
Eme Unit exponential distribution, F(s) = lee? We jn 


mize the computing time of the program the maximum transfor- 


mation was simulated by generating the reduced sample 
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en (]y. Since 


Ms) = (1-e 2) Sm ISS ES (or) 
solving for s gives 


RSEN (6.3) 


Hence, m' pseudo-random numbers Yi» T , were generated 
and the reduced sample Sa Stay was computed directly 
Aena equation 6.3 (i.e., SJ = -In(1-r,"/%), ee This 
Procedure saved considerable computing time, if v is much 
eu ter than 10, by reducing the number of pseudo random 
numbers required from m down to m' = m/v. The values of 


the parameters used in the simulation are listed in the 


following table. 


Epp ge € 


0.47830 0.48768 0.50481 0.49641 


The stochastic approximation was then applied to the 










peduced sample to obtain the estimate So Cm’) which is the 


value obtained on the m'th iteration using the following 


lThe random number generator, RANDOM, used was an 
accmeation of the generator reported in Lewis, P.A.W., 
Goodman, A.S., and Miller, J. M., "A Pseudo-Random Number 
Generator for the System/360,' IBM Systems Journal, v.8, 
Nee, 2, 1969 The generator was modified by G. P. Learmonth 
to produce double precision pseudo-random numbers and a shuf- 
fling scheme implemented to improve the randomness of these 
numbers. Copies of the program may be obtained from 
[DD Learmonth. 
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equation: 


ISO SN 


5 =o) , (6.4) 


8,161) = Said) - Š | 
where the function sgn(x) assumes the value 1 if x > O and 
Era 0, C is the inverse of the density f (S1) ; and 
S. (0) is an arbitrary starting value. 

The initial values of C and S. 1 (0) were obtained using 
the first three sample values S1» S5 and S3 from the 
reduced sample. These values were ordered to get S(19º 
o) and Er The following approximations were then 
used for C and ODE 


8[(S 


1 E S! S! = St ) A 
C = K Vu ur. and S,,(0) = Sto). (6.5) 
ESO °G) MDC 


26 





VII. RESULTS AND CONCLUSIONS 


The results of the linear regression on the 0.999 
mm ile estimates for the unit exponential distribution 
without jackknife in Goodman, Lewis, and Robbins (1973) 
provided some evidence that the term of lower order than 


— : I . 
m didæexist, The results using equation 3.1 were 


bias = 20.4m *897 (7.1) 
and from equation 3.4 the results were 
bias = 2.328m `” + 93.67m | (7.2) 


From equation 7.2 it was apparent that the neon term was 
stonrficant in the bias of the estimate even though the 
value Y = 0.5 was based on a conjecture only partially 
Eb ewuntrated by the results in equation 7/.l. The result 
that ¥ = 0.697 can be partially accounted for by the fact 
that the m”? term is still a significant factor at the 
Empis values considered. The results of the linear 
Eu on on the jackknifed data in Goodman, Lewis, and 
Robbins (1973) did not provide any additional insight 
woane rate of convergence of the quantile estimate. 
Although these results were not conclusive in 
establishing the exact order of the bias, they did tend 
COM pport the conjecture that the higher order term was 
m Ii was therefore decided to extend m' out to a value 


of 200 and jackknife for a m? germe 
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The simulation was initially programmed to achieve an 
m-fold jackknife. However, results of the simulation were 
that the variance of the estimate tended to increase above 
emeecetical level. The variance of the jackknifed estimate 
For the 399 Zzquantı lewen stherunicsesponentieledistributıon 
was as much as 100 times higher than that of the stochastic 
approximation without the jackknife. These results are 
listed in Table 7 and may be compared with the final 
Br mie zor the 2-fold jackkn1te listed in Table 19 for the 
same quantile. It was thus concluded that without some 
kind of data transformation the m-fold jackknife was 
impractical. Consequently, the DEOR was modified to 
mMuMmestigate the 2-fold jackknife. 

The variance reduction technique using control variables 
as discussed by Gaver (1969) was initially programmed into 
the simulation in an effort to reduce the run time required 
to achieve a reasonable value for the variance of the 
estimate. The control variable used was the number of 
sample values from the reduced Sonnen se e coded SM 
divided by m' the total number of sample values. The 
result was that a reduction of approximately 20 per cent 
formule variance of the estimate of the expected value of 
the quantile estimate without the jackknife was achieved. 
However, the effect on the variance of the estimate of the 
expected value of the jackknifed estimate was negligible. 

Gonvertineg the program from the m-fold to the 2-ifold 


jackknife precluded the evaluation of Tukey's idea of 
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estimating the variance of the quantile estimate using the 
variance of the pseudo-values. This assertion is based on 
the premise that the distribution of the pseudo-values 
tends to normality and that the pseudo-values are approxi- 
mci uncorrelated. However, results from the data 
obtained by Goodman, Lewis, and Robbins (1973) for the 
coefficients of skewness and excess of the estimates obtained 
via a stochastic approximation tend to negate the assumption 
of a normal distribution. These results are plotted in 
memes 2 and 3. If, as the theory of stochastic approxi- 
usoncasserts, the asymptotic distribution is normal, the 
convergence is very slow. It would therefore appear that 
using the pseudo-values to estimate the variance would not 
be fruitful in the present application of the jackknife 
without some kind of normalizing transformation. It may 

be that the basic problem is the correlation between pseudo- 
values, not the lack of normality; but this has not been 
investigated. 

The final simulation was programmed to compute the 
stochastic approximation with maximum transformation with 
and RE 2-fold jackknife for the qos teme 
Mec mis for the unit exponential 0.5, 0.9, 0.95, 0,975, and 
0.999 quantile and the 0.5 quantile of the uniform (0,1) 
distribution are listed in Tables 8 through 19 with plots 
of the estimated bias versus sample size. The remaining 
evaluation is based on the results obtained for the unit 


exponential distribution. 
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In each case the jackknife estimate was negatively 
based as predicted from equation 4,14. However, evalua- 
ron or the 0.5 quantile data (Table 11) indicates that the 
resulting rate of convergence to be on the order of noe ee 
Further, successive regressions on the points in the tail 
OF ne curve indicate that the value of y is decreasing, 
possibly to y = 0.25. Also, the sample bias obtained for 
ROS quantile goes negative at a sample size value 
between 192 and 200 which negates the assumption that the 
coefficients in equation 3.4 are positive. 

Wine results support the conjecture that a m'? term 
Mas renificant in the bias. However, it also appears that 
there exists a lower order term, probably of the order 


eo This would imply that the bias may be of the form 


25 > O 


: -. x 
bias = am + pm POI * dm 


a 25 


+ O(m AS 


A weighted linear regression using this equation was 
performed on the data for the unit exponential o = 0.5 
quantile without jackknife listed in Table 10. The resulting 
estimates for the leading four terms were a = -0.0501824, 

6 = 0.0251160, € = 0.790984, and d = -0.667208. The values 
of the observed bias, bias estimated from the equation fit 

by the weighted linear regression and the residuals are 
i edin Table 20. The equation fit achieved a multiple 
correlation coefficient squared of .9999. Note that the 
simulation for this term is very large (390,000 replications) 


so that such a large value of R^ USOS Ups ing. 
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cane conjecture that che bids als of the form in 7.3 
Ea r uc the bias computed as a result of the jackknife 


for the u term would be 


23 1 


bias = +.544am ` = ^648cm '/? 


15.25) 


- 1.41dm ~ + O(m ; (7.4) 


which is consistent with the results obtained. 


235 


AEjaekknife toreric m term would result in a bias 


of the form 
1 DE 


o 125 


bias - -1.19bm '? - 2.609cm '/? - 4.29dm L « o(m l:^9), (7.5) 


which would indicate that the magnitude of the resulting 
bias may be a large negative TS and converge as m». 
lpEract, a pilot run to jackknife the n S term gave 
Results with a large negative bias and converging approxi- 
mately as ns These results are plotted inori pure LOR 
We pilot run consisted of 120,000 replications for the 
EE 5 quantile of the unit exponential distribution. 

An informal analysis of these results indicated that 
e Trate of convergence was on the approximate order of 
me Ariese results appear tomcontonimcainiy well with the 
results predictable from equation 7.5. It would therefore 
appear that the form of the bias of the stochastic approxi- 
mation with maximum transformation may be of the form in 
equation 7.3. 

If the form of the bias ismana itor the form in 7.3, 
a possible approach would be to jackknife for both the 


m "5 and m '?? terms. This could be accomplished by using 
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as pseudo-values 


Sus - ASS (m) Í BS, (m/2), Í DE neq 7.60) 


where Š, (m) and S,, (m/2) ; are as previously discussed and 
S (m/4) ; ; 1s computed by partition ne cheese o: 
m/2 values into two disjoint subsets of m/4 values each. 
Then S, 0/4) ; is the estimate based on the values in the 
jth subset from the ith half sample set. 


The jackknifed estimate then becomes 


un? 
+ 
Um 


21 22) (7.7) 


asr he result that if A, B, and C are chosen as 


A = 1-B-C, | 5) 
s = (lee) /2-1)/ O cen 
w a a e e n (7.10) 
then 
E(S,) - S, > 1.68cm 75 + 6.06dm 1 + o(m 1:25), — (7.11) 
| 25 


which is void of them ` and ine pias terms. This may, 
of course, inflate the variance of the estimator to an 
unacceptable level. | 

In conclusion, an assessment of the utility of the method 
wor obtaining an extreme quantilewdiscussed in this thesis 


can be made by referring to Figures ll and 12. These show 


the standard deviations and root mean square errors of the 
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ler Statistic estimate, stochasticeapprowmaulconycstimate 
With maximum transformation (with and without jackknife) as 
a function of the sample size. 

tiene 15 clearly an Increase minir hens (anda rdev lon 
ethe quantile estimate, for all m, as one goes from the 
estimate based on order statistics to the stochastic approxi- 
mation estimate and then to the stochastic approximation 
taith jJackknife. However, this is more than offset by the 
(fixed) smaller memory requirements for the stochastic 
approximation estimate and the fact that the time needed 
to obtain an estimate from a sample of size m is propor- 
tional to m, while the order statistic estimate requires 
memory proportional to m and time (to order the data) 
Proportional to mlog(m). These factors become even more 
Critical when considered in the context of estimating 
large numbers of quantiles or in cases where m is large 


(see Goodman, Lewis, Robbins, 1973). 
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